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Abstract 

Let G n be the genus of a two-dimensional surface obtained by 
gluing, uniformly at random, the sides of an n-gon. Recently Linial 
and Nowik proved, via an enumerational formula due to Harer and 
Zagier, that the expected value of G n is asymptotic to (n — lnn)/2 
for n — > oo. We prove a local limit theorem for the distribution of 
G n , which implies that G n is asymptotically Gaussian, with mean 
(n — In n) /2 and variance (In n) /4. 

1 Introduction and main result 

In topology, it is traditional to represent a surface by gluing the sides of 
a polygon j6j [7]. The information of which side is glued to which can be 
encoded by a chord diagram. For example, the classical presentation of torus 
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by gluing the opposite sides of a square can be presented by a chord diagram 
with two intersecting chords: 




2? 




Combinatorially, a chord diagram with n chords is the same as a word of 
length 2n where each letter occurs precisely twice; think about the letters 
written at the end of the chords and the word is to be read off the circle 
counterclockwise starting with the point (0, 1). 

Given a chord diagram D, the surfaces represented by it can be recovered 
in the following way. First, attaching a lower semi-sphere to the circle of D 
and thickening the chords of D to narrow bands we obtain a surface with 
boundary: 

fJ\JJ\ 

c 
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Then attaching a disc, face, to each boundary component of that surface we 
will get the desired closed surface. We consider only orientable surfaces here. 
By genus of a chord diagram we understood the genus of this surface. 

In this paper we are interested in the distribution of genus G n of the 
chord diagram chosen uniformly at random from among all (2n — 1)!! such 
diagrams with n chords. For example, for n = 3 the genus G3 takes only two 
values and 1 according to the picture: 



genus 



genus 1 





Our work was inspired by a recent paper of Linial and Nowik [3] who esti- 
mated the expected value of G n : 



This estimate is implied by a harmonic sum expression for E[Gr n ] they derived 
from a Harer-Zagier formula [2] for the bivariate generating function of {c„ i9 }, 
with c U) g being the number of n-chord diagrams of genus g. In principle, the 
Harer-Zagier formula can be used to obtain sharp asymptotics for higher 
order moments of n + 1 — 2G n . The formulas get progressively messier, 
which makes a distributional analysis of G n quite problematic. From two 
first moments we deduce that that the standard deviation of G n is of order 
(Inn) 1 / 2 . Using this simple information and a contour integration formula 
based on the Harer-Zagier formula, we prove a local limit theorem for the 
distribution of G n , i. e. a sharp asymptotic estimate for the numbers c n>s , 
with \g— Ei[G n ] | <C (In n) 7 ^ 10 . As a corollary, G n is shown to be asymptotically 
normal, with mean (n — lnn)/2 and standard deviation Vhin/2. 

The Harer-Zagier formula was discovered in [2] for the purposes of com- 
putation of the Euler characteristic of moduli spaces of complex curves. It 
is tightly related to matrix models of quantum gravity [3]. We recommend a 
remarkable book jl] for an excellent exposition of the Harer-Zagier formula 
and its relation to different of mathematics and physics. 

The topological construction of surface from a chord diagram above gives 
a graph with single vertex and n loops embedded into a surface and dividing 
the surfaces into a number F of cells, faces. Then the Euler characteristic 
of the surface is 1 — n + F. On the other hand, the Euler characteristic of 
an orientable surface of genus G is equal to 2 — 2G. Thus for the number of 
faces F we have F = n — 2G + 1. By our result, with high probability the 
genus G n of a random chord diagram is very close to (n — lnn)/2. Hence the 
number of faces F n typically tends to be very small, of order Inn, relative 
to the number of edges n. Geometrically it means that typically there is at 
least one face with a large number of sides, of order n/lnn. It would be 
interesting to explore the distribution of the number of sides of individual 
faces. For example, is there typically just one face with that many sides, or 
there are several (how many) such faces? Going out on a limb, we conjecture 
that, analogously to cycles of a uniformly random permutation on [n], with 
high probability there exist several faces, with O(n) sides each. 

To conclude, we should mention that the primary focus of Linial and 



3 



Nowik [S] was a random directed n-chord diagram, generating a random 
oriented surface in a different way, for which a counterpart of the Harer- 
Zagier formula is unknown. In that case they used an ingenious combinatorial 
argument to show that 

Tt 

E[G n ] = --6(lnn). 

Acknowledgment. We would like to thank Nathan Linial and Tahl 
Nowik for useful comments on the first version of the paper. 



2 Harer-Zagier formula 

In terms of the distribution {p n ,g '■= ~P{G n = g)}, the Harer-Zagier for- 
mula [2] is equivalent 



i + 25>x + y +1 - 2 »=(^Y 

n,g ^ ' 



(2.1) 



[5]. To illustrate the power of ( 12. ip . let us compute p„ jn /2 = P(G n = n/2), 
which is the probability that the random surface has exactly one face, F n — 1. 
(Of course, p n ,n/2 = for n odd.) It follows from ( 12. ID that 



1 + X 



X 



Hence, for even n. 



Pn,n/2 = [X n+1 ] ^ ~ Pn.n/2 



? n + 1 

j odd J 

Observe that l/(n + 1) is the probability that the uniformly random permu- 
tation u n+ i of [n + 1] is cyclic. More generally, for k > 1, 

2 P(F n = k) = [x n+l ] ^ [ln(l + x) - ln(l - x)f 



\ j odd J 



Et)" < 2 - 2 > 
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Let O a fi denote the total number of permutations of [a] consisting of b odd 
cycles. From (12. 2p and a standard exponential identity 



> — r- O af) = exp 
a! 




0,6 

it follows that 

P(F n = k) = 2^[x^} J2 ^O a , k = 2 fe - 1 |^ j 

_ 2«-i p^ Wn+1 consists of k odd cycles). (2.3) 

We will show that the equation ( 12. ip can also be used to find sharp 
asymptotic expression for the moments E[G*]. However, we will use only the 
first two moments, as obtaining a limiting distribution of G n via the moments 
method appears to be quite problematic. Besides we set our sights higher, 
on a local limit theorem, for which the moments-based techniques are too 
crude in principle. Our main tool is contour integration already implicit in 
the derivation of Theorem 2 in [2] from the equation ( 12. ip . This theorem 
(see also jH Corrolary 3.1.8]) states that 

c =(2n-iyip = ^ m( 1/2 V +1 (24) 

where \t 2g ](f{t)) denotes the coefficient at t 2g in the power series Taylor 
expansion of the function f(t). The contour integration we mentioned is a 
simple consequence of ( \2A\i : 

V ^ I 1 ( t/2 Y" ■!!■ (25) 

Pn ' 9 2m (n - 2g)\{n + 1) J &+ 1 Vtanh(t/2) J ' 1 } 

c 

here C is a simple closed contour surrounding such that all the non-zero 
roots of tanh(i/2) = are outside of C. Later on we will choose C depending 
on n which will allow as to get the desired asymptotics of p n>g . 



3 Asymptotics of p n ^ g 

The formula ( 12. ip is perfectly tailored for asymptotic evaluation of the facto- 
rial moments E[(n + 1 — 2G n )k\ , k > 1, ((m)k '■= m(m — 1) • • • (m — k-\- 1)). 
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Indeed, differentiating ( 12.1 p k times with respect to y, and setting y = 1, we 
get 

2 £ x»«E [(„ + 1 - 2G„) J = 1±£ (in l±f) * . 

So 

E[ (n +l-2 G „) t H [.»+■] ^(.„I±|)\ 

Using an asymptotic formula for [x m ](l — x) _a (ar -^(l - x)- 1 ) 1 , (Flajolet 
and Sedgewick [1], Section VI. 2), it is straightforward to write down a series- 
type asymptotic formula for E[(n + 1 — 2G n )k\- In particular, for k = 1, 

E[n + 1-2G„] =lnn + ln2-r'(l) + 0(ln- 1 n). (3.1) 

Equivalently 

E[G n ] = \ - ^ + 1(1 - In 2 + r (1)) + Otln- 1 n). (3.2) 

This sharp estimate can also be obtained from the harmonic sum-type for- 
mula already obtained in [5]. Analogously, for all k > 1 

E[(n + l-2G n ) fc ] = ln*n + 0(ln* -1 n). (3.3) 

The relation ( 13.31) implies that 

n + 1 - 2G n 

i > X ' 

Inn 

m probability . Moreover, using (13. ip and ( 13. 3 p for k = 2, we see that 

Var[n+ 1 -2G n ] = O(lnn); 
So, by Chebyshev's inequality, 

P{|G„ - E[G n }\ > (hm) 1 / 2 ^} < b (lnn)~ 2s . (3.4) 
More expressively, by (13. 2p . 

Gn = |-^ + o p ((lnn) 1 / 2+e ), Ve>0; (3.5) 
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the o p notation means that the remainder scaled by (lnn) 1 / 24 ^ converges to 
zero in probability. The upshot of (I3.4j) - (j3.5p is that from now we may, and 
will, focus on the generic values g of G n satisfying 
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n 
2 



Inn 



< (Inn 



,V2+e 



(3.6) 

At the risk of belaboring the obvious, the equation ( 13. 4p is equivalent to 

£ p n , g = l-0{{\nn)^). (3.7) 

g meets ( 1 3 . <3 I t 

Armed with 03 .7p . we will determine an asymptotic formula for p n ,gi with 
g in the range (I3.6p . By fl2.5p . 



Pn,, 



2m(n - 2g)\(n + 1) J t 2 a +l \tanh(t/2) 



dt. 



(3.8) 



c 



Since 



tanh(t/2) 



e* - 1 
e* + r 



its roots are 27Tui, v = ±1, ±2, .... Since the integrand in (13. 8 p is odd, we 
seek C symmetric with respect to the origin t = 0. One would normally 
consider a circular contour of radius t = t(n,g), where t(n,g) is an absolute 
minimum point of 



n+l 



t > 0. 



t 2 9 \tanh(t/2)_ 

However, for g in the range f)3.6p . t(n, (?) turns out to be asymptotic to Inn; so 
a circle of that radius would enclose not only t = 0, but also lots of imaginary 
zeroes 2Trui, v ^ 0, of tanh(t/2). That is, this circle is inadmissible. Instead 
we will select as C a thin horizontal rectangular contour; its short vertical 
sides pass through the points t = ±i, with t ~ Inn, and the long horizontal 
sides pass through the points t = ±m, the zeroes of coth(£/2) closest to the 
origin t = 0. 







i 

iri 












-t 







— ► 

t 






—iti 





Observe that we confine ourselves to the same i = i(n) for all g satisfying 
( 15. 6 p . How to choose t ? Our guiding intuition is that, for some g in the 
range (I3.6p . (i, g) is a stationary, saddle-type, point of a logarithmically-sharp 
approximation of f(t,g)/(n — 2g)\. 

Since n — 2g ~ Inn for g in (13. 6p . we have 

(n-2g)\ = y/toc(n-2g) {^-^X ^ (l + O^nf 1 )). (3.9) 

So we define u = 2g, and introduce 

e t 

if (t, u) — (n — u) In h (n + 1) In . . . — u In t 

v ' ' v ; n-w v y tanh(t/2) 

= (n-u)ln— h(n+l)lncoth(i/2) + (ri + l-u)lnt. (3.10) 

n — u 

A stationary point of H(t,u) is a solution of 

H t (t,u)= -(n + l)-\-+ n + 1 ~ U =0, (3.11) 
smh t t 

H u (t,u) = ln(n-it) -Int. (3.12) 
From (I3.12p . t = n — u, and (13. lip becomes 

— — sinht = n + l. (3.13) 

Taking logarithms of both sides of (13 . 1 3j) . we easily obtain 

t = ln(2n)-— ]— + 0(ln" 2 n). (3.14) 
In {2n) 

The corresponding value g is therefore 

u n — i n — In n ^ . . 
»=2= — = ^ + 0(1 »- (315 » 



And g is well within the target range (13. 6p ! 
Using u = n — i and (13.141) , we compute 

e* + 1 

F(t,*u) = t + lnt + (n+ l)ln 



e* - 1 

ln(2n) + lnlnn+ (n + l)(2e- i_ + 0(e" 2i ")) 

= ln(2nlnn) + 0(ln _1 n). (3.16) 



S 



Furthermore, for g = u/2 satisfying (13. 6p . and an intermediate u, 



H(t, u) = H(t, u) + H u (t, u)(u -u) + -H uu (t, u)(u - uf 



H{t,u) - 



2(n-u) 
= H(t,u 



— (u - uf + 0(\u - u| 3 (n - u)~ 2 ) 



2 Inn 



[u-uf + 0((lnn)- 5 ); (3.17) 



here 5 := —3e + l/2>0ife<l/6, which we assume from now on. 
Putting together (13. 6 h and (I3.9p -( I3TT71) . we transform (13. 8 p into 



V 



(Inn 



1/2 



(2nf/H 

c 



cxp 



{u — uf 



2 Inn 

coth(t/2)\ 
coth(t/2) J 



0((\nn)~ s ) 
dt. 



n+1 / 



(3.18) 



Since u = 2g is even, the integrand in (I3.18P is odd, just like the one in 
(13. 8p . Consequently, the contour integral is twice the contour integral over 
C* = Ci U C 2 U C 3 ; here C x = {t = -ttz + x, < x < t}, C 2 = {t = t + iy : 
— 7T < y < it}, and C3 = {t = iri + x : t > x > 0}. Using the main branch of 
In z, i. e. with the cut {z : Im z = 0, Re z < 0}, the integrand in (I3.18P for 
t G C* can be written as t~ 1 e h( - t ' u \ where 

. / x / ,m coth(t/2) , _ ,t 

Let us show that, asymptotically, u can be replaced with u, i. e. the contour 
integral is almost independent of u. 

On C 2 , since i ~ Inn, 



h(t, u) = h{t, u) + (u — u) In 



t + iy 



and 



= h(t,u) + 0(t~ 1 \u-u\) 
= hit, u) + 0((lnn) £ " 1 / 2 ), 

|/i(t,iZ)| < b ne _ *+ (n+1 - n)?' 1 = 0(1). 
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Therefore 



j l e Ht,n) dt = j l e h(t,u) dt + 0((lnn) £ - 3/2 ). 



Ci Ci 

On C3, since t = Inn + 0(1), 



h(t,u) =(n + l)ln 



+ (n + 1 — u) In 



n + 1) In 



e* + 1 
e* - 1 



t 



, , e 21 — 1 . . , x + Z7T . . 

(n + 1) In + (n + 1 - «) In — ^- + 0(1). 



e x + 1 



i 



Consequently 



e x - 1 



7T J 



Re «) = (n + 1) In \- (n + l-u) In 

e x + 1 t 

The first order derivative of the explicit part on the RHS is 



ra + l , , x 

—, h (ti + 1 — u 

sinni 



71+1 1 

x 2 + 7r 2 sinh t 2 



Hence 



Re h(t, u) < {n + 1 - u) In ^ + - - — - + 0(1) 



t — x 
2 

t — x 



+ 0{(n + l- u)/t) + 0(l) 
+ 0(1), 



as 71 + 1 — m = O(lnn). Thus 



and picking 7 e (0, 1), 



M, U ) df 



t£C 3 :t-ti>x>0 
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for all u in question, including u. For t G C3 with x = Re t >t — t' 



Re {h{t, u) — h(t, u)) ={u — u) In 



\Jx 2 + 7T 2 



OfQnn) 1 / 2 ^- 1 ] =0((lnn)" CT ). 



where a := 1/2 — 5 — 7 > 0, if we choose 7 e (0,1/2- e), which we do! In 
that case 



«/ * Jt-t~t 



t€C 3 :t>x>t-t~f 



X 



(3.23) 



0--1 



<b (Inn 

Combining f)3.22p and f)3.23p . we obtain 



Kt,u) dt _ I _ e h(t,u) dt =0 ( e -f + (l nn )--i) 



c 3 



C 3 



(3.24) 



= 0((lnn)" CT - 1 ). 
The same argument yields 

I 1 - [e h ^ -e h ^] dt = O^nn)-^ 1 ) = 0({\nn)~ z / 2+£+ ^) . (3.25) 

Ci 

Combining fl3~T9|) . ( l3T24j) and (l3T25|) . we conclude that 

j- t e h ^ u Ut=2 j- t e h ^ u Ut= J n + 0((lnn)- 3 / 2+£ ^); 



c* 



:=2 



,fc(t,fi) dt 



(3.26) 



(We cannot write I n as the contour integral over the whole C since u may 
not be an (even) integer.) 



The rest is short. Using ( 13. 26ft . we rewrite (13. 181) as follows: 



Pn,, 



(Inn) 1 / 2 
(2yr) 3 / 2 i 

x 



cxp 



(u — u) 
2 Inn 



0((lnn)- 5 ) 



(3.27) 



[j n + 0((lnn)- 3 / 2+ ^)]. 
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Summing this expression over g = u/2 in the range (I3.6P and using (13. 7p . we 

get 



l-0((lnn)- 2£ ) = [l + 0((\nny s )} I 



(2tt) 3 / 2 ? 



u:g=u/2 meets (1 3 ■ 6 p 



where 



R n := (lnn)- 1+£+7 ^exp 



(u — u) 2 



2 Inn 



(u — u) 2 
2 Inn 



0(R n ), (3.28) 



Recall also that 

£ <l/6, 5=l/2-3e, e + 7<l/2. 

Recognizing the sum in the definition of i? n as a Riemann sum for J R e~ x2 ^ 2 dx 
times vrnn, we see that 

R n <b (lnn)" 1/2+£+7 . (3.29) 
Likewise the sum in (I3.28p . (with u running through even integers), equals 

Vlnn 



( [ e- z2 / 2 dz + 0((\nnr^ 2 )) 



V27rln 



n 



[l + O^nnY 1 / 2 )] . (3.30) 



Plugging the estimates (13. 29f) and H3 .301) into (I3.28P we arrive at 



Inn 



[l + 0((lnn)- CT )] 



(3.31) 



where 



a = min{2e, 1/2 - e - 7, 1/2 - 3e}. 



By choosing 7 > sufficiently small, we can get a arbitrarily close, from 
below, to the largest value of min{2e, 1/2 — 3e} = 1/5, which is attained at 
e = 1/10. Combining (13. 2T|) and ( 13 . 3 1 j) . we have proved the following local 
limit theorem. 
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Theorem 3.1. Let c n , g denote the total number of chord diagrams of genus 
g. Then 



(2n-l)!! Pn ' 9 ~ y/27r(\nn)/A 
uniformly for g satisfying 

\g-g\< (Inn) 7 / 10 "", (g = (n - lnn)/2 + 0(1)) 



1 + 0((lnra) 



-1/5+a) 



exp 



(9 ~ 9? 
2(lnn)/4 



(3.32) 



for a > 0, arbitrarily close to zero. As a corollary, G n is asymptotically 
normal with mean {n — lnn)/2 and variance (lnn)/4. 
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